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Abstract 

We study financial distributions within the framework of the continuous time ran- 
dom walk (CTRW). We review earlier approaches and present new results related 
to overnight effects as well as the generalization of the formalism which embodies 
a non-Markovian formulation of the CTRW aimed to account for correlated incre- 
ments of the return. 



1 Introduction 

The continuous time random walk (CTRW), first introduced by Montroll and 
Weiss in 1965 pQ, constitutes a powerful tool for studying the microstructure 
of many random process appearing in a large variety of physical phenom- 
ena. These include transport in disordered media [2|3] , random networks [I], 
self-organized criticality [5], electron tunneling [5], earthquake modeling [7115] , 
hydrology [91TT0] and time-series analysis pT|T2] . 

Physicists have recently provided some examples of CTRW's applied to fi- 
nance. Thus the papers by Scalas et al [13] were among the first works ad- 
dressed to this issue. Further developments have been given in [T^yTo] (see [IB] 
for a short review of these works) along with the work of Kutner and Switala 

HZ). 

In recent papers ^8lfl9] we have also developed, in an independent and slightly 
different way, the formalism of the CTRW designed to study financial data in 
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the so called "high frequency regime" , that is, data obtained from tick-by-tick 
prices. Based on the assumption of independent and identically distributed 
events, it is possible to get a general expression for the probability distribution 
of prices at time t in terms of two auxiliary densities that can be estimated from 
data: the probability density of the pausing time between ticks, ip(t), and the 
corresponding density for the magnitude of the price increment at a given tick, 
h(x) (see Eqs. ([2]) and ([3]) below for a formal definitions of these quantities). 
However, with the specific formalism developed in [18J, and specially [19], one 
can only get intraday or at most probabilities at the end of a given session, i. e., 
we have obtained, based on tick-by-tick data, the probability density function 
for the price within a trading day. This can be a limitation of the model, since 
in some financial applications one is interested in longer distribution of prices. 

In this paper we want to extend some of these developments. In this sense we 
will present a more general model for the correlation between the densities 
if)(t) and h(x) that is based on the theory of copulas [20]. We will also present 
a method to include the so-called "overnight effect" . Finally, we introduce the 
general equations for a non-Markovian CTRW which takes into the previous 
history. 

The paper is organized as follows. In Sect. [2] we present a summary of the 
existent general formalism, and its asymptotic approximations, as applied to 
financial problems. In Sect. [3] we treat the volatility and outline how to treat 
"inverse problems" , i.e., those of obtaining high-frequency quantities out of the 
knowledge of low-frequency statistics. In Sect.HJwe developed some extensions 
and generalization of the CTRW which include overnight and memory effects 
in the description of prices. Conclusions are drawn in Sect. and technical 
details are left to an appendix. 



2 CTRW: an overview 

We present an outline of the CTRW formalism as developed in the framework 
of financial problems. We refer the reader to Refs. [TS] and [H] for comple- 
mentary aspects of the subject. 



2. 1 Fundamentals of the Formalism 

The central object of our study is the zero-mean return X(t) defined by 

X(t) = \n[S(t + t )/S(t )] - (\n[S(t + t Q )/S(t )]), (1) 
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where S(t) is a speculative price, or the value of an index, and to is an ini- 
tial time. In what follows we will assume stationarity. In this case X(t) is 
independent of t . 

We now suppose that X(t) can be described in terms of a CTRW. In this pic- 
ture X(t) changes at random times • • • , t-i, to, ti,t2, • ■ ■ ,t n , • • ■ and we will first 
assume that the intervals between successive steps, which we call sojourns or 
waiting times, r n = t n — t n -i are independent, identically distributed random 
variables (in a latter section we will relax this assumption) with a probability 
density function, ip(t), defined by 

i/j(t)dt = Prob{t < r n < t + dt}. (2) 

At the conclusion of a given sojourn the zero- mean return X(t) undergoes a 
random change equal to 

AX n (r n ) = X(t n ) - X{t n ^) 
whose probability density function is defined by 

h(x)dx = Prob{x < AX n <x + dx}. (3) 



We now define a function p(x,t) such that p(x,t)dxdt is the joint probability 
that an increment in return is added whose magnitude, AX n , is between x 
and x + dx and that the time interval r n between successive jumps is between 
t and t + dt, i.e., 

p(x, t)dxdt = Prob{x < AX n < x + dx; t < r n < t + dt}. (4) 



We assume that p(x, t) is an even function of x which ensures the absence of 
drift. Two marginal densities can be formed out of p(x,t). The pausing-time 
density ip(t) and the density for a random jump h(x). These are related to 

p(x,t) by 

oo oo 

ip(t) = J p(x,t)dx, h(x) — J p{x,t)dt. (5) 

-oo 



Note that a direct consequence of the unbiased assumption, i.e.: p(—x,t) = 
p(x,t), is that h(x) is also symmetric around the origin 

h(-x) = h(x), (6) 



so that its odd moments are equal to zero. 
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Fig. 1. Schematic representation of the return process. The dots mark the value 
X(t n ) of the return after each sojourn. T n = t n — t n -\ is the time increment of the 
n-th sojourn. 

As to the trajectory of the random process, we assume that X(t) consists of a 
series of step functions (see Fig. [T]). In other words, the return evolves discon- 
tinuously and during any sojourn the value of the return remains constant. 

The value of the return X(t) at time t will be given by the random value of the 
height at t (see Fig. [Tj). Our goal is to obtain the probability density function 
of X{t). This function is the propagator, defined by 

p(x, t)dx = Prob{x < X(t) < x + dx}. 

From this definition and Fig. [1] we see that the propagator prior to the first 
jump, which we denote by po(x,t), is equal to 



p {x,t) = *{t)6(x), 



(7) 



where we have assumed that the initial jump occurred at t = 0, and \l/(t) is 
the probability that the time between two ticks is t or greater, i.e., 



W(t) = J ^(t')dt'. 
t 
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In terms of po and p we have that the pdf for the return at time t, p(x,t), 
satisfies the following renewal equation: 



t oo 

p(x, t) = po(x, t) + / dt' I p{x ,t')p(x — x ,t — t')dx . (9) 



-oo 



This equation is derived from the consideration that at time t, the process is 
either within the very first sojourn, this given by the first term on the rhs of 
Eq. 0, or else the first sojourn ended at time t' < t, at which time the return 
had value x', and from (x',t') the process is regenerated. 

Equation (Q can be solved in terms of the joint Fourier-Laplace transform 

oo oo 

p(w,s)= [e~ st dt I e iuJx p(x,t)dx. 

The solution is 



o 



P(U,S) = - r, 10 

l-p{u,s) 



where ^0(^5 s ) an d p(o;, s) are respectively the joint Fourier-Laplace transforms 
of the functions p (x,t) and p(x,t). We can easily see from Eqs. © and (JH]) 
that the explicit form of po(u, s) is given by 

p (u,s) = (11) 



where ip(s) is the Laplace transform of the pausing-time density ip(t). Equa- 
tions f|T0|) - f|TT]) furnish a complete solution in the transform domain and is a 
convenient starting point for numerical methods when the inversion cannot be 
carried out in closed form. 



2.2 Some models 



Unfortunately the form of p(x, t) appearing in the general solution fflUj) cannot 
easily be determined from the available data. More easily accessible are the 
marginal pdf's h(x) and ip(t). It is therefore essential to assume a functional 
relation between p(x,t) and its marginal densities ip(t) and h(x). The simplest 
choice would be based on the assumption that return increments and their 
duration time are independent random variables [T3]. In this case 

p(x,t) = h(x)m, (12) 
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so that Eq. (ITU1) becomes 



1 - 



(13) 



However, in some situations one expects some degree of correlation between 
return increments and their duration, while Eq. (112j) implies complete inde- 
pendence between increments and sojourn times. Following that intuition we 
have presented in [T8lfT9] two different functional forms for the joint density 
which take into account the intuitively plausible argument that one must wait 
for a long time in order for a large variation of return to occur. In the Fourier 
and Laplace space, these forms are respectively given by 



p(u, s) = h(u)~ij) sh(u 



(14) 



and 

p(u,s) = 4>(s)~h[u4>(s)]. (15) 

That for both models large increments may be infrequent it is clearly seen by 
the fact that in each case return variations and waiting times are positively 
correlated. Indeed, let r = t n — £ n _i and AX 2 be respectively the sojourn time 
and the return quadratic increment. We define the correlation between these 
two random variables by the quantity 

r = (AI 2 r)-(AI 2 )(r). (16) 
The cross average (AX 2 r) can be expressed in terms of p(u, s) by 

(17) 

uj=s=0 



(AX 2 r) 



6» 3 p(, 



0J,S 



duo 2 ds 



which, after the use of ansatz in Eqs. ( fT4l) and assuming symmetrical return 
increments: (AX) = 0, yields (AX 2 t) = 2(AX 2 )(t). Hence r = (AX 2 )(t) > 
0. 

For the ansatz in Eq. ( fl5l) an analogous calculation gives for the cross cor- 
relation (AX 2 t) = 3(AX 2 )(t), whence r = 2(AX 2 )(r) > 0. Therefore, for 
both models AX 2 and r have a positive correlation, and increasing return 
variations imply increasing waiting times and vice versa. 

We now concentrate on a third model for the correlation between jumps and 
waiting times which is borrowed from the theory of copulas: a rather modern 
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development in mathematical statistics which studies probability distributions 
with fixed marginals [20]. In this context, copulas are defined as functions 
that join (that is, "couple") multivariate distribution functions to their one- 
dimensional distribution functions 0- Specifically, our model is provided by 
the so-called "Clayton copula" in which the joint density p(u, s) is given by 

Em 



-l/a 



where a is an arbitrary real number. For this model one can easily show that 
the correlation defined in Eq. (I16p is given by 

r = a(AX 2 )(r). 

Therefore any positive correlation between waiting times and jumps will cor- 
respond to a > 0. We should remark at this point that one of the main 
advantages of the Clayton copula (jl~8l) over our previous models is precisely 
given by the parameter a since it governs the size of the correlation between 
waiting times and jumps while in (1141) and (1151) this correlation is fixed and 
solely determined by (AX 2 ) and (r). Moreover, a gives an extra degree of 
freedom which can be useful in the modeling of empirical data. 

The model ffl8l) together with Eq. fTTTl) leads us to write the formal solution 
to the problem given by Eq. fflOl) in the following more explicit form: 

p( w » s ) = — r~ — [ — ^ tt7^- ( 19 ) 

1 - h~ a (uj) +i[)- a (s) - 1 



Let us present an explicit example in which it is possible to evaluate p(x, t) 
directly. Suppose that a = 1 and that waiting times and jumps are both 
exponentially distributed. That is 

m = \e~ xt , (20) 



and 



h(x) 



7 - 
2 6 



'7M 



(21) 



where 7 > is such that (AX 2 ) = 2/7 2 is the jump variance. In such a case 
we show in the Appendix |A] that the return pdf is given by the following 

1 In this sense the models defined by Eqs. (|14|) and (|15|) can also be termed as 
copulas. 
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Fig. 2. The return pdf in dimensionless units p("fx, At) /j given in Eq. (I22p for x > 0, 
i.e., without the delta function term appearing in (|22p . By the symmetry in x, p(x, t) 
behaves similarly for negative returns. 

expression 



In Fig. [2] we plot this pdf for positive returns x > 0. 

We finally remark that models given by Eqs. ( fl2~l) . ( JT4l) . ( |T5l) and ( fT8|) are just 
examples of how to obtain the return pdf, p(x,t), once we have guessed from 
data plausible forms for ip(t) and h(x). This is one aspect of the so-called 
"direct problem" and it is certainly the case when we deal with tick-by-tick 
data and want to obtain an expression for the return pdf at any time. We refer 
the reader to refs. [18]- [19] for explicit examples based on actual financial data. 

2.3 Asymptotic analysis 

In the Appendix [B] we show that an asymptotic expression as t — > oo of the 
distribution exemplified by its characteristic function p(u,t) is 




(22) 





(23) 
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We observe that this asymptotic expression is totally general regardless of 
the correlation between sojourn times r and return increments AX, requiring 
only that the mean sojourn time (r) be finite. Obviously for an usable form 
of this asymptotic expression we will need to know the correlation between r 
and AX. In other words, we will have to specify a form for p(x, t) since the 
correlation appearing in Eq. (123]) is related to p by 



(re twAX ) 



(24) 



Thus, for instance, making use of ansatz in Eq. (fl8|) we have 

(re iujAX ) = (r)h l+a (uj) 

so that 



p{uj,t) 



h 1+a (u 



1 f \l-h(cu)]t) 



{r)h 1+a (to) 



(25) 



Now assume that h(x) has a finite second moment pi so that h(uj) ~ 1— p2Uj 2 /2 
as u — > 0. In this case, for values of uj sufficiently small, Eq. (1231) yields the 
Gaussian density 

p(u,t) ~e-^ 2/2{r) } (26) 



which is equivalent to the central limit theorem result. 

On the other hand for a long-tailed jump density for which h(u) ~ 1 — k\uj\ u as 
uj — ► (note that if < v < 2, h(x) has infinite variance) and assuming that 
for ou small (re lujAX ) ~ (r), we obtain from Eq. (123!) the Levy distribution 

p(u,t) ~ e ~ fcM ^ /<r) . (27) 



The Levy distribution has been proposed by several authors as a good can- 
didate for describing financial distributions when the observed fat tails are 
not Gaussian [2T|22 .23j. However, from the discussion above we see that in 
the framework of CTRW's the appearance of Levy distributions is linked to 
the existence of transactions for which the jumps have infinite variance. Since 
this cannot occur in real markets we conclude that the Levy distribution is an 
unsuitable candidate for the propagator. 

We finally show that the tails of the return pdf p(x, t) are the same as that of 
the jump pdf h(x). Indeed, as is well known [3] the tails of p(x, t) as \x\ — > oo 
are determined by the behavior of its characteristic function as uj — > 0. Thus 
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suppose that u is small so that (re lujAX ) ~ (r) and t is moderate in the sense 
that the approximation given in Eq. fl23l) is still valid but in such a way that 
[1 - h(u)]t/(r) < 1. Then from Eq. ([23]) we see that 

p(LJ,t)~-l[l-h(L>)]t/(T), (w-0), 

which after Fourier inversion and neglecting delta function terms yields 

p(x,t) ~ -r^h(x), (|x|->oo). (28) 



Therefore, the analytic form of the tails of p(x,t) as a function of x coincides 
with that of h(x). 



3 Volatility and inverse problems 



Aside from the pdf p(x,t), which provides maximal information about the 
evolution of X(t), there is another quantity of considerable practical interest: 
the variance of X(t). In our analysis this quantity, called "volatility" in the 
terminology of finance, has the advantage that it does not require the knowl- 
edge of the entire jump distribution h(x). It suffices to know the pdf ip(t) and 
the first two moments of h(x). 

Let (X n (t)) be the n-th moment of the return process: 

oo 

(X n {t)) = J x n p{x,t)dx, 

— oo 

and let us denote by rh n {s) its Laplace transform 

oo 

rh n (s) = ( e- st (X n (t))dt. 



o 



This can be written in terms of the joint Fourier-Laplace transform of p(x, t) 
by 



m n (s) = i 



duo n 



(29) 

uj=0 



Recall that a direct consequence of the unbiased assumption expressed in Eq. 
<Q is that the odd moments of h(x) are equal to zero. Let /i n be the nth 
moment of the jump density. Then the 2nth moment can be written in terms 
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of the 2nth derivative of the characteristic function as fx 2n = (—l) n h^ 2n \0). 
All of this implies that the odd moments of the return process vanish: 



(X^\t)) = 0, (n = l,2,3,...). 

The combination of Eqs. ffTU]) and (T291 leads to the relation 

. . , R 2 (s) 



(30) 



m 2 {s) 
where 

£2 00 = 



s[l-^(s)]' 



^ P(w, s) 



(31) 



&u 2 



(32) 



oj=0 



For the Clayton copula defined by Eq. ([18]) we see that 

R 2 (s)=^ 1+a (s), 
where = (AX 2 ) is the second moment of the jump density h(x). Hence 



m 2 (s) = fa 



V> 1+a (s) 
s[l-j>(s)]' 



(33) 



For Poissonian sojourns this equation yields 

/i 2 A 1+Q 



m 2 [s 



s 2 (\ + s) a, 
which after Laplace inversion reads [21] 



l> 2 



(\t) 1+a F(a,a + 2,-\t) 



T(2 + a] 



(34) 



(a > —2), where F(a,c,x) is the Kummer function. Taking into account the 
following limiting behaviors of the Kummer function [23] 



and 

we have 

(X 2 (t)) 



F(a,a + 1, -At) 



F(a, a + 1, -At) ~ 1 (t -> 0) 
T(a + 2) 



/*2 



T(2 + a) 



(At) 



1+0 



r(2) 

(At < 1) 



(At)~ a (t -> 00), 



(35) 
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and 



(X 2 (t)) ~/i 2 At (At»l). 



(36) 



We observe an anomalous diffusion behavior at short times and a diffusion 
behavior at long times. 

In fact one can easily show that the asymptotic volatility grows linearly with 
time regardless the specific form of p(x, t) selected. Indeed, since in all realistic 
models pi and (r) are finite, we see from Eq. (123!) that 



Therefore the asymptotic return volatility increases proportional to t indepen- 
dent of the model chosen since low-order moments are supposed to be finite 
in realistic models. Again this is a consequence of the central limit theorem. 

3.1 Inverse problems 

In the above development we have determined the return density p(x, t) after 
knowing the densities ip{t) and h(x), we can call this "the direct problem'". 
However, from the CTRW formalism it is possible to infer the mechanism of 
price formation at a microscopic level by obtaining the statistics of tick-by-tick 
data represented by the pausing-time density ip(t) and the return increment 
density h(x) once we know the volatility (X 2 (t)) and the return pdf p(x,t). 
We term this "the inverse problem" . Obviously this problem will require the 
knowledge of the complete pdf p(x, t) for all times t, something that can be 
beyond our reach, since in practice one may only have access, for instance, to 
daily data from which one is able to guess the pdf for the daily return p(x, t), 
where t — 1 day. In such a case we do not have an analytical expression for 
p(x,t) which would allow us to get the microscopic densities ip(t) and h(x) 
exactly. Nevertheless with the formalism presented below we will be able to 
guess which forms of ip(t) and h(x) are consistent with the observed p(x,t). 

We will now outline the solution of the inverse problem and refer the reader to 
Ref. [19] for complementary material on this subject which include examples 
based in actual cases. 

Assuming for the joint density p(x, t) the functional form given by the Clayton 
copula, Eq. (Tl8l) . the formulation to the problem given by Eq. (ITQl provides 
a relation between the Fourier and Laplace transforms of the daily return 
density p(x, t) and tick-by-tick densities ij)(t) and h(x). Obviously a second 



(X 2 (t)) ~ t (t -> oo). 



(37) 
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relation is required to determine these microscopic densities, 
by the volatility. Indeed, we write Eq. (133]) as 



This is supplied 



'-^) + f^^( s )-^)=0 > 



/'2 



/'2 



which is an algebraic transcendental equation for the now unknown function 
i(j(s). The simplest situation corresponds to a = —1, for in such Eq. 
(138]) gives at once 



^(s) = 1 



f'2 



sm 2 (s) 



(39) 



Other cases in which a closed solution is readily obtained correspond to a = 
—1/2 and a — 1. Let us briefly comment the latter case. When a = 1 Eq. fl38l) 
reduces to a second degree equation, whose solution reads 



sm 2 {s) 



4fi 2 



sm 2 {s) 



- 1 



(40) 



As an example let us see for which density ip(t) corresponds to an anomalous 
diffusion volatility of the form 



(X 2 {t))=kf 1 (Jfe,7>0). 



(41) 



In this case 



and from Eq. (140]) we have 



m 2 {s) 



M\l + 7 ) 



,1+7 



i>{8) 



b -7 
-S ' 

2 



4st 



(42) 



where 



& = AT(l + 7)/// 2 , 



(43) 



and r(x) is the Gamma function. We proceed to the numerical inversion of 
Eq. (142]) by using the Stehfest algorithm [191127] . In Fig. [3] we plot the density 
if)(t) thus obtained for several values of the exponent 7. 
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Fig. 3. The inverse Laplace transform of the pausing time distribution (142ft for 
7 = 1/2,1,3/2 and with time measured in units of (b/2) 1 /~ / t. We present in the 
inset the log-log representation of ip(t). 

We can also obtain, by using Tauberian theorems, the long and short time 
behaviors of ip(t). In effect, for small values of s, Eq. (|4~2|) yields 

^(s)~l-s 7 /6 (s<6 1/7 ) 

and the Tauberian theorems imply that 

m ^^n r i-, (t» r V7). (44 ) 

On the other hand, for large values of s, we have 

$(s) ~ b 1 '^-' 12 (s > b lh ). 



Hence 



W! 2 

^^n^lf (*«^ 1/7 )- (45) 



We thus have a power-law pausing density at short and long times. 

Let us finish this section by pointing out that once we have an estimate of 
ip{t), we can address the problem of obtaining the pdf of the return increment 
h(x). Note that setting s = in Eq. (TIP]) and taking into account that (c/Eq. 
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(HIl))po(w,0) = (T),weget 



p(cu,0) 



(T) 



(46) 



l-h(w)' 



so that 



h(u) = 1 



(T) 



(47) 



p(w,0)" 



Since 



oo 



p(u,0) = / p(u,t)dt, 







Eq. (|47p allows us to get the characteristic function corresponding to the 
jump density h(x) if we know the entire return pdf p(x,t) for all t > 0. 
In consequence Eq. (1471) is rather difficult to implement since, as we have 
mentioned, in many practical situations one only knows at most a numerical 
series for the value of the propagator instead of an analytic expression of 
p(x, t). In these cases Eq. (I4"7j) is useless for evaluating h(x). However, Eq. 
( 1471) can be very useful for testing any statistical hypothesis, and eventually 
evaluating unknown parameters, on the form of h(x). 



4 Some generalizations 

Up to this point we have developed and applied the CTRW along the lines 
set up in our previous works [THUS]. We will now generalize and extend the 
formalism in two different ways. The first one include the so-called "overnight 
effect" which will allow to extend the formalism to periods of time greater 
than one day. The second generalization deals with the general equations for 
the CTRW when non-Markovian effects, due to past correlations, are taken 
into account. 



4-1 The overnight effect 

At first sight, the time t appearing in the return density p(x,t) solution of 
the renewal equation Eq. (jUJ), it can be any time interval. However, t = 1 
day is, in practice, the upper bound to this time. This is so because the 
existing formalism does not take into account the overnight effects which cause 
that the closing price of one day does not usually coincide with the opening 
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price of the following day o Some observable consequences of the overnight 
effect on return distributions have been adressed in the literature 28 21)11301 . 
We, therefore, extend the CTRW by including the overnight effects into the 
formalism after assuming them to be of a random character. 

Let Sj^l n and S^cL ^ e respectively the opening and closing price of day n. 
Then the zero-mean return at the end of the session of this day is (see Eq. 

©) 

Z n = m [<5 c lose/S'open] ~ ( m [S close /S'open])> 

and the zero-mean return due to the overnight effect is 

Y n = lnfS'op^ V^closel — (M^opcm V^closc] ) ' 

Therefore, the detrended return at the beginig of day iV + 1 is given by the 
sum 

N 

X N = Y,(Z n + Y n ). (48) 

n=l 



Suppose now that both Z n and Y n are independent and identical distributed 
random variables with probability density functions given respectively by p(x) 
and (f(x), i.e., 

p(x)dx = Pr{x < Z n < x + dx}, <p(x)dx = Pr{x < Y n < x + dx}; 

and let Pn{x) be the probability density function of the return at the beginning 
of day N + h 

P^(x)dx = Pr{x < X N < x + dx}. 
If Pn{uj), P(lu), and <p(uj) denote their corresponding characteristic functions, 
then Eq. (|48p and the assumption of independent events lead to 

P*H = ^f- (49) 



This equation determines the probability distribution of the return at the be- 
ginning of day iV+1 if one knows the distributions p(x) and <p(x). In a practical 
situation the density for an overnight transition, <^(x), has to be guessed from 
data. Nevertheless, p(x) can be obtained from the CTRW formalism. Indeed, 
following the notation of the previous sections, let p(x, t) be the return pdf at 
time t (recall that p(x,t) is the solution of the renewal equation (jUj)). Then 

p(x) = p(x,t s ) 

2 In our previous work [19], in order to have enough statistics from the only time 
series of prices of a given market, we have artificially forced the closing price of day 
n to be equal to the opening price of day n + 1, in other words, we have neglected 
the overnight effects. 
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where t s corresponds to the length of one session. Since for most markets the 
average time between successive transaction (r) is about several seconds and 
the duration of a session t s is usually of some hours (6.5 hours for the American 
market), we see that t s 3> (t). Therefore we can apply the asymptotic results 
discussed in Sect. [21 specifically Eq. f l23l . with the result 

pW^-p)- 1 ^) ('■»«). (50) 



where (re lujAX ) is given by Eq. 



Equation (1491) along with Eq. fl50l) and a guessed form of the overnight jump 
density f(x) provide a general approximate expression for the probability 
distribution of returns after N days (N — 1, 2, 3, • • •). 

Let us finish this section by giving a simple expression for the volatility (X^) 
after N days. In terms of the characteristic function Pn{uj) this quantity is 
given by 

/ y2 \ d 2 P N (oo) 
\ a n) - ■ 



duo 2 



uj=0 



From Eq. ffj9|) and taking into account the assumed symmetry of return jumps, 
either intraday and overnight, we get 



q2 

(Xl} = -N — \p(u,t s )+p(u)} 



uj=0 



Let us denote by z/2 the overnight mean-square return jump: 



1/2 = 2/ x 2 ip(x)dx = —if (0). 



On the other hand, from Eq. (loPl) we have (see also Eq. (jSTJO 



d 2 p(uj,t s 
duj 2 



uj=0 



"<r> 



Hence 



+ V 2 



(51) 



where t s = 6.5 hours. Therefore, (X^) grows linearly with N. In other words, 
the iV-day volatility (N = 1, 2, 3, ■ ■ •) has a normal diffusion behavior [3T] . 



17 



4-2 A non-Markovian model for the CTRW 



The entire formalism of the CTRW outlined in the preceding sections relies 
on the fundamental assumption that both waiting times r n = t n — t n -\ and 
jump increments AX n = X(t n ) — X{t n -\) are independent and identically 
distributed random variables. However the hypothesis of independence may 
be questionable. This is specially certain for jump increments in which, in 
studying certain phenomena related to extreme times [32J, there seems to be a 
degree of dependence between successive jump returns. We will thus generalize 
our formalism in order to include possible memory effects. 

To incorporate these effects the simplest generalization consists in assuming 
that the joint pdf for return increments and waiting times p(x, t) as defined 
in cf Eq. (J4]) is replaced by the following Markovian density 



p{x, t\x' , t')dxdt = Probjx < AX n < x + dx; 

t<r n <t + dt\AX n _ x = x'; r n _ x = t'}. (52) 

We remark that in this case the whole process X(t) is always not Markovian 
because the return pdf p(x, t\ A.xq, At ) depends on both the magnitude of the 
previous jump Axo = X(to) — X(t-i) and its sojourn time At = to — t_i; in 
other words, the return distribution at time t depends on two previous times 
t and t_! (t < t < 

Now the integral equation governing the evolution of the return pdf is given 
by the renewal equation 

p(x,t\Ax , At Q ) =po(x,t\Axo,At ) 

oo t 

+ J d yj4T P{V ,T\^M*-y,t-T\«,T), (53) 

-oo 

where 

p (x, t\Ax , At ) = 6(x)*(t\Ax , At ), (54) 



and 



-$(t\Ax , At ) = J dr J p(y,T\Axo,At )dy. 



(55) 



t — oo 



Note that for the case of independent increments discussed in the previous 
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sections we have 

p(y,r\Ax ,At ) = p(y,r), 
and Eq. ([53} equals Eq. ©. 

The level of complexity of the above formalism can be somewhat decreased 
when correlations between consecutive waiting times are negligible. A situation 
that appears in the study of some financial time series [32J. In this case instead 
of Eq. fl52l) we write 

p(x, t\x')dxdt = Probja; < AX n < x + dx; t < r n <t + dt\ AX n _i = x'}, 

and the renewal equation for the distribution of X(t) reduces to 

oo t 

p(x,t\Ax ) = p (x,t\Ax ) + J dy J dTp(y,r\Ax )p(x - y,t - r\y). (56) 

-oo 

This integral equation can be written in a simpler form using the joint Fourier- 
Laplace transform: 

oo oo 

p(u,s\Ax ) = J e~ st dt J e iuJx p(x,t\Ax )dx. 

-oo 

We have 

oo 

p(uj,s\Ax ) = ty(s\Ax )+ J e lujy p(y,s\Ax )p(uj,s\y)dy, (57) 



where p(y, s\ Axo) is the Laplace transform of the conditional joint density. We 
thus see that in this non-Markovian case the Fourier-Laplace transform of the 
return pdf satisfies an integral equation and, without further assumptions and 
simplifications, we cannot get a closed expression for it. This contrasts with 
the Markovian case discussed in Sec. [2] in which we got a closed and general 
expression for p(u, s) given by Eq. ffTO]) . 

We finish this section by presenting the integral equation to be satisfied by 
the volatility. Let 

oo 

st I vn i 



m n (s\Ax ) = J e~ st (X n (t\Ax ))dt 
o 

be the Laplace transform of the conditional n-th moment of the return process: 

oo 

(X n {t\Ax ))= f x n p{x,t\Ax )dx. 
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Hence (see Eq. (129]) ) 



rh n (s\Ax ) = i 



—n 



d n p(u, s|Ax ) 



(58) 



From Eqs. (I5T1) and fl58|) we readily get 



m n (s|Ax ) = 51 / 2/" fc p(2/, s|Ax )m fc (s|?/)rfy 




(59) 



which constitutes a recursive set of integral equations for m n (s|Axo) (n = 
1, 2, 3, • • •). If we extend to this non-Markovian case the unbiased assumption 
for the joint density p: 

p(x,t\Ax ) = p(-x,t\Ax ), 
one can easily prove that the odd conditional moments vanish, 

(X 2n -\t\Ax )) = 0, 
and the Laplace transform of even moments satisfy 



(n = 1, 2, 3, • • •). In particular for n = 1 we see that the Laplace transform of 
the (conditional) volatility obeys the integral equation 




(60) 



oo 




(61) 



—oo 



where 



oo 




(62) 



— oo 



is the second moment of the return increments. 
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5 Conclusions 



We have presented a comprehensive review of the CTRW formalism applied 
to financial systems. This technique is specially suited to deal with high- 
frequency data and allows the treatment of both direct problems and inverse 
problems. We recall that by a direct problem we understand obtaining the 
"macroscopic" pdf, p(x, t), and volatility, (X 2 (t)), from the "microscopic" den- 
sities ip(t) and h(x), while by and inverse problem the situation is reversed, 
that is, one has to obtain the microscopic densities out of the macroscopic 
ones. 

We have extended our previous treatment of the joint density of sojourn times 
and return increments in a more systematic way by using the notion of "cop- 
ula" . We have work in detail the so-called Clayton copula. On the other hand, 
we have introduced into the formalism the overnight effects, which allow us 
to obtain results for times higher than one day. This will allow to treat more 
complex inverse problems. This aspect is under current research. 

We have finally generalized the usual formalism, based on the assumption of 
independent events, to include memory effects. This being motivated by some 
empirical findings on the existence of longer correlations which indicate the 
need for a non-Markovian treatment of the problem. We have presented the 
general setting of a non-Markovian CTRW and in a forthcoming publication 
we will do a deeper treatment and apply the formalism to actual cases. 
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A Derivation of Eq. (22) 

From Eqs (|Z0j) and (|ZIj) . we see that 



A 



h(u) 



1 



(A.l) 



A + s' 



1 + cj 2 / 7 2 ' 
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Since a = 1 then the Clayton copula ([15]) yields 

A 



p{co,s) 



s + A(l + a; 2 /7 2 ) 



(A.2) 



or, after inverting, 



1/2 



p(z,t) =71 — 1 < 



n ^ \ -\t-~/ 2 x 2 /4\t 



(A.3) 



Equation ([19]) now reads 
1 



A + s 



1 + 



uj 2 h 2 + s/A 



(A.4) 



The inverse Fourier transform yields 

1 „, , 7V^ 



s) 



A + S " w ' 2{\ + s)J~s 
Taking into account the following Laplace inversion 

' 1 J\ P -a 2 /4t 



(Trty/ 2 



and the convolution theorem, we finally obtain the following closed expression 
for the return pdf 



p(x, t) = e 



-At 



S(x) + e?-^l^dt 
Jit J 



(A.5) 



which is Eq. $22). 



B Derivation of Eq. (23) 



Since AX and r are random variables representing respectively returns incre- 
ments and sojourn times, we can write p(u, s) in the form (see Eq. (j4j)) 



p(u, s) = (exp{iuAX — sr}). 
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Expanding this average around s = 0, we get 

p{u, s) ~ h{u) - s(re lujAX ) (s -> 0), (B.l) 

where we have taken into account that the characteristic function for jumps, 
h(u), is given by 

h{u) = (e iujAX ). 

On the other hand, if we assume that the sojourn-time density ip(t) has finite 
moments then 

^(s)~1-s(t), (s^O), (B.2) 

where (r) is the mean sojourn time. From Eq. ffTTl) we have 

Po(w,s)~(t), (s-0). (B.3) 



Substituting Eqs. flB~H and flB~3|) into Eq. jTD]) yields 



p(u;, s) ~ = ^ (a -> 0). (B.4) 



By virtue of Tauberian theorems [3 ; 26j, the asymptotic behavior of the char- 
acteristic function p(u>, t) as t — > oo will be given by the inverse Laplace 
transform of Eq. (1B.4P . This inversion reads 

~, \ (r) f [l-ft(w)ltl . v ,„ 

K^0^^^exp|- L— I (t^oo), (B.5) 



which proves Eq. 
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